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Abstract 

This work considers the identification of the available whitespace, i.e., the regions that are not 
covered by any of the existing transmitters, within a given geographical area. To this end, n sensors 
are deployed at random locations within the area. These sensors detect for the presence of a transmitter 
within their radio range r^, and their individual decisions are combined to estimate the available 
whitespace. The limiting behavior of the recovered whitespace as a function of n and is analyzed. 
It is shown that both the fraction of the available whitespace that the nodes fail to recover as well 
as their radio range both optimally scale as log (n)/n as n gets large. The analysis is extended to the 
case of unreliable sensors, and it is shown that, surprisingly, the optimal scaling is still log (n)/n even 
in this case. A related problem of estimating the number of transmitters and their locations is also 
analyzed, with the sum absolute error in locahzation as performance metric. The optimal scaling of the 
radio range and the necessary minimum transmitter separation is detemnined, that ensure that the sum 
absolute error in transmitter localization is minimized, with high probability, as n gets large. Finally, 
the optimal distribution of sensor deployment is determined, given the distribution of the transmitters, 
and the resulting performance benefit is characterized. 
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Index Terms 
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I. Introduction 

Whitespace identification, or determining the regions within a given geographical area of 
interest that are not covered by any of the existing transmitters, is useful in several applica- 
tions [[T|-[[3|. For service providers, it is important for finding dead zones, i.e., the coverage 
holes, in their service area. For cognitive radio (CR) networks, knowledge of the available 
whitespace is crucial in order to ensure that the CR nodes do not cause harmful interference to 
the primary transmitters. This paper addresses this problem using an approach where n sensors 
are deployed in the geographical area of interest. Their collective observations regarding the 
presence of a transmitter in their radio range are used to estimate the available whitespace. 
A challenge here is to determine the optimal scaling of the radio range and the minimum error 
in the estimated whitespace, as a function of the number of sensors deployed. 

In the recent literature, various approaches for whitespace identification have been considered. 
One approach uses the receive signal strength (RSS) measurements obtained from the sensors 
to estimate the number of transmitters and their powers by minimizing the sum of the MSB 
in the location and power estimates f^. The problem of localizing a transmitter using binary 
observations instead of using analog RSS measurements has also been considered [[5|-[|7). 

A related problem that has received considerable research attention in recent years is that 
of tracking one or more targets with the help of multiple sensors (see [8] for comprehensive 
review of the literature). More specifically, the problem of target tracking with binary sensor 



measurements has been studied both theoretically [[8j-[ll| and experimentally [12|. In [[8|, one 



or more targets that are located arbitrarily in the field of interest are tracked using multiple 
sensors, while in [|9|, pT| , binary sensor measurements are used to decide on the presence 
or absence of targets at given locations. However, to the best of our knowledge, there have 
been no studies in the literature on the limiting behavior of whitespace recovery methods as 
the number of sensors deployed is increased. Of particular interest are questions related to the 
optimal scaling of the radio range of the sensors to achieve the minimum whitespace recovery 
error, the resulting recovery performance, the optimum spatial distribution of sensors, etc. 

In this paper, we consider a scenario where n sensors are deployed in a given geographical 
area for whitespace identification. For simplicity, and without loss of generality, we consider 
the unit interval C = [0, 1] as the area of interest. The n sensors are deployed uniformly at 
random locations in C These sensors detect the presence of a transmitter within a distance 
from their location, and return a 1 if there is at least one transmitter in their vicinity, and return 
otherwise. The total whitespace recovered is the union of the 2rs -length areas around the 
sensors that return 0. Our contributions are: 

1) We show that both the whitespace recovery error (loss), i.e., the fraction of the available 
whitespace that is not recovered by the n sensors, and the radio range both optimally 
scale as log{n)/n as n gets largej^ 

2) We extend the analysis to the case where the sensors report erroneous measurements 
with probability p. Surprisingly, even with unreliable sensors, the optimal scaling of the 
whitespace recovery error and the optimal radio range is still log(r;,)/?T,. 

'ah logarithms in the sequel are to the base e. 
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3) We also consider the problem of transmitter localization, i.e., that of determining the 
number of active transmitters and their locations. With the sum absolute error in transmitter 
localization as the metric, we derive the optimum radio range as well as the minimum 
separation between transmitters that guarantees that the localization error is below a 
threshold with high probability. 

4) For a given spatial distribution of the transmitters, we derive closed-form expressions for 
the optimal spatial distribution of sensors that minimizes the probability of not detecting 
a transmitter and the resulting minimum miss -detection probability. 

We validate our analytical results through Monte Carlo simulations. The simulation results also 
illustrate the significant performance improvement that is obtainable by using the optimal scaling 
for Tg, as the number of sensors n is increased, compared to using a slower or faster decrease 
of Tg with n (see Fig. |4]). Moreover, even though the results are true for large n, the scaling of 
Ts = \og{n)/n is optimal even at moderate or low values of n. 

Under a similar binary observation model in a 2-dimensional region with fixed sensor place- 
ments, it has been shown in [8| that the expected whitespace identification error scales as 
where p is the density of sensors and is the radio range. In [8], however, is assumed to 
remain fixed as p is increased, i.e., the results do not hold if is allowed to decrease as p 
increases. In this paper, with p = n, we are interested in optimal scaling of with n, as p is 
increased. Further, in our model, sensors are placed randomly in the region of interest and we 
obtain results that hold with high probability. We show that the optimal radio range scales as 

= 9 (^^), and we get a localization error of 6 (^^) with high probability. 

All of our results directly extend to 2-dimensional regions, with the quantities of interest such 



as the optimal radio range, optimal transmitter localization error, etc. being the square-roots of 
their counterparts in the one-dimensional case. Our results yield useful insights into the number 
of sensors to be deployed and their radio range for detecting transmitters that maximizes the 
recovered whitespace and accurately localizes the transmitters within a given geographical area. 
The organization of this paper is as follows. In Sec. |n[ we present the modeling assumptions 



and problem setup. In Sec. Ill we derive analytical results on the whitespace identification 



when the sensors are perfectly reliable. Section IV extends the results to the case of unreliable 



sensors, and Sec. |V] extends the results to jointly identifying the number of transmitters and 



their locations, with the sum absolute error in localization as the metric. Section VI presents 



the optimum distribution of sensors that minimizes the probability of missing a transmitter. 



Simulation results are presented in Sec. VII and concluding remarks are offered in Sec. VIII 



II. System Model 

We consider a unit length segment L, wherein M transmitter^ are arbitrarily located. We 
assume that n sensor are thrown uniformly at random locations on £. Each sensor has radio 
range rs{n), i.e., it can detect the presence of any transmitter that is at most rs{n) distance away. 
Each sensor returns one of two possible readings b, b = 1 if there is at least one transmitter at 
a distance of rs(n) from it, and 6 = otherwise. The sensor readings are combined at a fusion 
center to find the region ^void of C that is guaranteed not to contain any transmitter. Now, 
if Xi,Xi, . . . ,Xn are the sensor locations in C and 61,62, ■ ■ ■ ,bn are the corresponding sensor 

^In the sequel, we will interchangeably use the phrases "transmitters" and "primary transmitters" to refer to the transmitters 
whose locations and transmission footprints we wish to determine. 
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readings, then 

n 

Aoid = [J(l - bi)[mm{xi - rs,0),max{xi + r^, 1)]. (1) 

i=l 

Let i{A) = /^g^ da; denote the length of a region denoted by A. For example, if A is the union 
of a finite set of disjoint regions, then i(A) is the sum of the lengths of the disjoint regions. We 
define Avoid = ^(-^void) as the length of the region where no transmitter is located. Note that, 
since the transmitters are located at distinct points that occupy no area, we would expect that, 
as n — )■ oo, the transmitters are perfectly localized, and, Avoid — ^ 1- Hence, formally speaking, 
we want to find the minimum e{n) and a corresponding radio range rs{n) that guarantees that 

min lim P ((1 - Avoid) < e{n)) = 1. (2) 

rs(n),e(n) n— s>oo 

The probability in the above equation is evaluated over the distribution of the sensor locations, 
where the transmitter locations are assumed to be fixed but arbitrary. This metric essentially 
captures the scaling of the relative loss in recovering the whitespace, with increasing number 
of sensors, as a function of radio range rs{n). So, there are two problems to solve, i) finding 
the minimum scaling of the error e(n) with n, and ii) finding the optimal radio range rs(n) as 
a function of n. 

We note that, in addition to solving ([2]), we may also wish to find the number of transmitters 
and their locations. Specifically, given an estimate of the number of transmitters and their 
locations, if we assume that each transmitter has a transmission range of rp, then the available 
whitespace consists of the region of C from which the subsets of size 2rp around each transmitter 
have been removed. We discuss how to determine the number of transmitters and their locations 
in Sec. [Vj The quantities and r-p may be different, as the sensitivity of the sensor may be 
different from the sensitivity of the receiver to which the transmitter's signal was intended. 



The next section presents fundamental bounds on the whitespace recovery error and the 
corresponding optimal radio range, asymptotically in n, when the sensors are perfectly reliable. 

III. Reliable Sensors 

We first present a lower bound on the error e*{n), that guarantees that if lim„_5.oo — ^ 0, 
then lim„_^oo P ((1 — ^void) < ^ c, where c < 1 is a constant independent of n. In other 
words, we show that it is not possible to recover the available whitespace with an error smaller 
than e*(n) with arbitrarily high probability as n gets large. To derive the lower bound, we will 



use the following result from the 1-coverage problem in one dimension [13|. 

Lemma 1: Let n sensors be thrown uniformly at random locations on the one-dimensional 
unit- length segment C, where each sensor has radio range of r{n). A point x on £ is defined 
to be covered if there is at least one sensor in the interval [x — r(n), x + r{n)]. Then, if 
lim„_>.oo — !■ 0, then lim„_>.oo P(all points in C are covered) < C2, where C2 < 1 is a constant. 

n 

A similar result holds in 2-dimensions, with ^'jj^ — ?■ 0, where a point is said to be covered if 
there is a sensor in a radius r{n) around it. 

Theorem 1: For the whitespace recovery problem in a one-dimensional unit- length region, if 
lim^^oo ^ or Um„_,oo ^ -> 0, then P ((1 - Avoid) < e{n)) < 1. 

Proof: Consider the case of a single transmitter, i.e., M = 1. Letting M = 1 can only give a 
weaker lower bound. However, we will show that the lower bound is tight in Theorem[3} Then for 
(1 — Avoid) < e(^) to hold, we need either (i) for e{n) < rs{n), at least one sensor in both inter- 



vals 



e{n) 



X — rs{n),x — rs{n) + ^ 



and 



X + rs{n) — X + rs{n) 



, where sensors in both the 



intervals give reading 1, we call this event A, or (ii) at least one sensor in the following four inter- 



vals, [x — rs{n),x\ ,[x,x + Tsin)] , x — rsiji) — ^y^, x — rsiji) 



X + rs{n),x + rsiji) + ^ 
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Fig. 1. Depiction of tiie uncertainty in transmitter location for tire lower bound. 

we call this event B, where the sensors in the first two intervals give readings 1, while the sensors 
in the last two intervals give readings 0. We illustrate the events A and B in Fig. [T| Since the 
transmitter can be arbitrarily located, x can be anywhere in C. Thus, essentially, we need all 
intervals of length rs{n) and e{n)/2 to have at least one sensor. 

To find the probability that all intervals of length rs{n) and e{n)/2 have at least one sensor, 
we use Lemma [1} Note that the setting in this Theorem is identical to Lemma [TJ where n sensors 
(in place of transmitters) with radio range rs{n) are thrown randomly in C. From Lemma [T] 
we know that if Tsin) is less than order then lim^^oo -P(each point in C is covered) < C2, 
where C2 < 1 is a constant. If any point on C is not covered, then surely the interval of length 
2rs{n) around it has no sensor. Hence, if 2rs{n) is less than order then from Lemma [l] 
there exists an interval of width 2rs{n) that does not have any sensor with probability greater 
than 1 — C2. Similarly, if t{n) is less than order there exists an interval of width e(n) that 
does not have any sensor with probability greater than 1 — C2 from Lemma [1] thereby violating 



the conditions for having (1 - A^id) < e{n). Thus, if lim„_^oo ^ -> or hm„^oo 0, 

n n 

then P((l -Avoid) < e(^)) < 1. ■ 
The result for the 2-dimensional region is as follows. 

Theorem 2: For a 2-dimensional region, if lim„^oo -t- or lim^-^oo ^7^% ~^ 0' '^hen 

P((l-^void)<eH)<l. 

Proof: Similar to the proof of Theorem [1} using the 2-dimensional part of Lemma [T] ■ 

Next, we show that if rs{n) = 9 (^^) and e{n) = (^^), then the lower bound on 
the whitespace recovery error obtained in Theorem [T] is tight. For the proof, we will need the 
following Chemoff bound result. 

Lemma 2: Let Xi, X2, . . . be identical and independently distributed Bernoulli random vari- 
ables with mean yU = E{Xj}, and let X = Xlj'Li^j- Then for < 5 < 1, we have that 
PiX <{l-5)fi)<exp 

Theorem 3: If r,(n) = 6 (^), then for e(n) = 6 (^), P (1 - A^id < eH) ^ 1. 
Proof: Let rs(r?,) = ( '^'°^" ), where c > 1 is a constant. Divide C into smaller non- 
overlapping intervals of length ( ^^°^" ), and index these segments as zi to n \. Let the num- 

V clog n y 

ber of sensors lying in Zk be A^^^. From the Chemoff bound in Lemma 2| P{Nk < ^^^) < n~'^/^, 
and taking the union bound, P{Nk < for any k) < c^n^''^^^, where C4 is a constant. Thus, 
for large enough c, with high probability, each small interval Zk contains at least sensors. 
Since there are M transmitters, at the maximum, only M smaller intervals among zi . . . z/ „ \ 

\ clog n / 

contain any transmitter. Since the radio range = ( ^^°^" ), a transmitter lying in interval 
Zk can only influence readings of sensors lying in 3 adjacent intervals Zk-i,Zk, and, Zk+i. 
Therefore, there are at least ( ) — 3M intervals among 2:1, ... , Z( _n\ in which all sen- 
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sor readings are 0. Thus, an area of width ^^^n^) ~ '^-^) (^^^) contains no transmitter, 
i.e. Aoid > ((^) -3m) (^) = 1 - 3M(^). Therefore, with high probability, 
(1 - Avoid) < 3M (^), proving the Theorem. ■ 

Remark 1: The result for the 2-dimensional region follows similarly, with rs{n) = G 
and e{n) — G ^-y/^^^ • Further, all the results in the sequel are valid for 2-dimensional regions 
also, but we omit the formal statements to avoid repetition. 

In this section, we have shown that asymptotically in n, the optimal radio range scales as 
and the corresponding optimal whitespace recovery error also scales as For finding 
the lower bound, we leveraged the results on the coverage problem. Then, we used a Chemoff 
bound result to show that if radio range is order then each interval of width contains 
logn sensors with high probability, and, hence, we can get a whitespace recovery accuracy 
of order for large enough n. In this section, we assumed that the sensor readings were 
error-free. Surprisingly, the above results hold even when the sensors are unreliable, as we show 
in the following section. 

IV. Unreliable Sensors 

In this section, we assume that sensors are unreliable, and make an error in reading with 
probability p < ^ independently of all other sensors. Thus, a sensor reading is 1 even if there 
is no transmitter within a range of around it, or a sensor reading is even if there is a 
transmitter within a range of around it, and both events happen with probability p. In reality, 
the sensor errors could be a function of the distance from the transmitter, both in terms of 
missing a transmitter or identifying one when it is not present within the sensing radius r^. Let 
the upper bound on the miss probability or false alarm of any sensor be p over the sensing 
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radius r^. Then, our model, where each sensor makes an error with probability p, essentially 
takes care of the worst case scenario, while simultaneously simplifying the analysis. 

As before, we are interested in finding the minimum error e{n) and radio range rs{n) that 
solves the optimization problem 

min lim P ((1 - A^id) < e{n)) = 1. (3) 

rs(n),e{n) n-¥oo 

The following Theorem is the analog of Theorem [T| with unreliable sensors. Its proof follows 
simply because the lower bound with unreliable sensors cannot be better than the lower bound 
with reliable sensors derived in Theorem [TJ 

Theorem 4: When sensor measurements are in error with probability of error p < \, and for 
the whitespace recovery problem in a one-dimensional unit-length region, if lim„^oo 
or lim„^oo 0' *en P ((1 - A^id) < e{n)) < 1. 

n 

Next, we show that the lower bound in Theorem |4]is also achievable. The proof is constructive, 
and follows by proposing a reconstruction strategy and analyzing its error performance. 

Theorem 5: When sensor measurements are in error with probability of error p < ^, if 
r,{n) = e (^), then for e{n) = 9 (^), P (1 - ^oid < e{n)) 1 asymptotically in n. 

Proof: As in the proof of Theorem [sj let rs{n) = ( '^'°^" ), where c > 1 is a constant. Divide 
the segment C into smaller intervals of length ( '^'°^" ), and index these segments as zi to z/ „ \. 

\ c log n J 

Let the number of sensors lying in Zk be Nk. From the Chernoff bound, P{Nk < ^^^) < 

and taking the union bound, P{Ni. < for any k) < c^n^''^^^, where C4 is a constant. Thus, 

with high probability, for large c, each smaller interval Zk contains at least sensors. 

As before, since there are only M transmitters, at the maximum only M smaller intervals 
among zi . . . zr „ \ contain any transmitter. Since a transmitter lying in z^ can only influence 

V c log n ) 



12 

readings of sensors lying in 3 adjacent intervals z^-i, and z^+i- Therefore, in reality at least 
( ) — 3M intervals among zi . . . z/ „ \ should have all sensor readings as 0. However, 

Y c og n y \clogn J 

because of errors in sensor readings, some of the sensors in these intervals have readings 1 
instead of 0. To resolve this problem, we use the majority rule to decide whether an interval Zk 
contains a transmitter or not. Thus, a transmitter is declared to be present in an interval z^, if 
the number of sensors with reading 1 are more than the number of sensors with reading 0, and 
a transmitter is declared to be absent in an interval z^ otherwise. 

With this decision rule, Avoid = ^Ma]{zk)=o ^k, where the function Ma){zk) equals 1 if z^ has a 
larger number of sensors with a reading of 1 than with a reading of 0, and equals otherwise. 
Therefore, the probability of interest in this case is that of missed detection, Pmd, which is the 
probability that the majority of sensor readings in Zk is 0, given that there is a transmitter in 
an interval Zf^. Recall that each sensor makes an error with probability p independently of all 
other sensors. From the Chernoff bound, we know that in each interval Zk there are at least 
sensors with high probability, for large enough c. Let A^^ denote the number of sensors 
in Zk- Without loss of generality, we assume that A^^ is odd, as otherwise, we can consider 
one less sensor for making decisions. Then Pmd = J2'^1n^.+i ~ p)^''~^. From an 



upper bound known in coding theory for repetition codes [14|, ^ n^+i ( k)p^{^ — p)^'' ^ < 
^2a/p(1 — p)j . Hence, for p < ^, Pmd < ct^*, where a < 1, and Nk is of the order logn 
with high probability. Thus, the probability of missed detection P^d decreases exponentially 
with ^Mz?;. Since there are at the maximum j intervals in £, using the union bound, the 

probability that there is a missed detection in any one of the ( " ) intervals is ( ) a^^^ , 

-z •' \c log n I \c log n I ' 

/ \ C log Ti 

where a < 1. Thus, for c large enough, ( j — )• with high probability as n — > oo. 



13 

Using a similar analysis, we can show that the probability of false alarm in any interval 
Zk, i.e., the probability that the majority of sensor readings in Zk is 1, given that there is no 
transmitter in an interval Zk, goes to zero as n goes to infinity. 

Thus, with high probability, we have that (^(^ ciogri ) ~ 3Mj intervals not containing any 
transmitter have their majority of reading equal to 0. Hence, ^void > ^ (^ ciogn ) ~ 3M^ (^^^) ~ 
1 - 3M (^) with high probability, proving the Theorem. ■ 

In this section, we considered the case when each sensor makes an error with probability p 
in the detection of any transmitter within its radio range. The lower bound on the whitespace 
recovery error is the same as in the case of reUable sensors, since the error with unreliable 
sensors cannot be better than that with reUable sensors. For finding the matching upper bound 
on the whitespace recovery error, we let the radio range be of order so that each interval 
of width contained more or less log n sensors with high probability. Then, for each interval 
of width we proposed a majority rule for declaring the presence or absence of transmitter 
in that interval. Since there are a large number of sensors (roughly logn) in each interval, if 
p < 1/2, it followed from a repetition code argument that the probabilities of false alarm and 
missed detection go to zero for large n. Thus, we showed that, if the radio range is such that 
there are enough number of sensors in each small interval, asymptotically, the lack of reliability 
of the sensors has no effect on the whitespace recovery error. 

Note that, in the whitespace identification problem discussed above, we used the locations of 
sensors that returned a measurement to find the available void space that is guaranteed to not 
contain any transmitter. In the next section, we consider the problem of estimating the number of 
transmitters and their locations, with the absolute error in locating the transmitters as the metric 
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of interest. We show that when the number of transmitters is unknown, a localization error of 
^-^^^ can still be achieved in the large number of sensors regime, provided the transmitters are 
known to be at a minimum separation of the order i^^^M from each other. 

V. Transmitter Localization 



In Section III we considered the problem of finding the whitespace ^void that contains no 
transmitters using binary sensors randomly deployed in the area. In addition to finding the void 
space area, there are several related problems of interest, such as finding the received power 
profile that describes the received power from the transmitters at each point of the given area 
£, finding the number and locations of the transmitters, etc. Towards that end, in this section, 
we are interested in finding how many transmitters are present and their locations on C using 
binary readings from the n sensors that are uniformly randomly distributed on C. 

As in the previous section, each sensor is assumed to have sensing radius ra{n), and has 
two possible readings, 1 if there is at least one transmitter at a distance of from it, or 
otherwise. For simplicity, we consider the case of reliable sensors in this section. Results with 



unreliable sensors follow similarly, as in Sec. IV To estimate the number of transmitters and 



their locations, we note that each disjoint region containing sensors that returned the value 1 
contains at least one transmitter. Hence, we estimate the number of transmitters to be equal to 
the number of disjoint regions containing sensors that returned the value 1, and we estimate 
the transmitter locations Xi to be the geometric centroid of each such region. Note that, any 
contiguous region containing sensors that returned the value of 1 could potentially have more 
than 1 transmitterj^ This could lead to errors in estimating the number of transmitters and/or 

^In particular, if the region is of widtli greater tlian 2rs, tlien it must necessarily contain more tlian one transmitter. 
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their locations, as there is no way of identifying the number of transmitters within regions 
containing sensors that measured a 1. To overcome this, in this section, we assume that any 
two transmitters are at at least S{n) > distance apart. As we will see, under mild assumptions 
on 6{n), it is possible to correctly estimate the number of transmitters and their locations with 
high probability, asymptotically in n. 

Let there be M transmitters on C, and the true location of transmitter j be Xj. Let the 
estimate of M, the number of transmitters, be M, and the estimate of the location of the i^^ 
transmitter using the n sensor readings be Xi,i = 1,2, .. . ,M, as described above. For both 
the true locations and their estimates, we index the transmitters from left to right on C, such 
that Xi < X2 < ■ ■ ■ < xm and Xi < X2 < ■ ■ ■ < Xj^j. Then we define the error metric as 
^max{M,M} |_^^ — Xi\, whcrc by definition we have that for M < M, Xj = 0, z = M + 1, . . . , M, 
and for M > M, Xi = l,i = M + 1, . . . , M . This metric penalizes a mismatch between the 
actual and estimated number of transmitters, in addition to the error in localizing them. 

We are interested in finding minimum error e{n), transmitter separation 5{n) and radio range 
rs{n) that guarantees that 

(max{A/,A^} \ 
y \xi — Xi\ < e{n) = 1. (4) 
i=i j 

The next Theorem characterizes the lower bounds on t{n), rs{n) and S{n) required for high 
probability estimation of the number and locations of the transmitters. 

Theorem 6: If lim„_i.oo — ^ 0, then P / ^™a^{^^>*^} j^,. _ ^.| < e{n)\ < 1. Similarly, if 
lim„^oo ^ 0, or nm„_oo |M then P [^-f^'*} _ < e(n)) < I. 

Proof: First, the bounds on t{n) and Tsin) follow from Theorem [T] with M = 1. For 
the lower bound on 5{n), consider M = 2 transmitters at locations xi and X2 with distance 
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\x2 — Xi\ = S{n) between them. To be able to decide that two transmitters are present, i) at 
least one sensor has to he in between xi and X2 with a reading of 0, or ii) rs{n) has to be less 
than or equal to 5{n), since otherwise the sensors lying outside of the interval {xi,X2) cannot 
discern whether there are two transmitters or one, as both xi and X2 will possibly be in their 
range r^. 

Since the two transmitters can be arbitrarily located on C, condition i) implies that each 
interval of length 6{n) on C should contain at least one sensor. Similar to the proof of Theorem 
[Tj the probability that each interval of length S{n) contains at least one sensor is upper bounded 
by a constant less than 1 if lim„_^oo ts^I = 0. We already know that, for lim„_>.oo — )■ 

n n 

0, P ^^™3Jc{M,M} _ ^ e{n)^ < 1 . Thus, conditions i) and ii) together imply that for 
lim^^oo ^ = 0, P / ^max{M,M} -Xi\ < e{n)j < 1. ■ 

Our next result shows that e{n) = rgin) = 6{n) = 6 (^^) is sufficient for estimating the 
number and location of transmitters with high probability, asymptotically in n. 

Theorem 7: If r^in) = 6{n) = e{n) = 6 (^), P \x, - Xi\ < e(n)) ^ 1. 

Proof: Let rs{n) = c > 1, and let the minimum transmitter separation 6{n) = 

where d > 10c. Divide the region C into smaller intervals of length ( ^°'^^°^" ), and index these 
segments as zi to Z( „ \. Hence, each interval contains at most one transmitter. 

V 10c log n ) 

Partition each Zi into five equal parts of width (^^7^) , and index them with Pj 1, . . . , Pj 5. Let 
the number of sensors lying in Pj ^ be Nij. From the Chemoff bound, P{Nij < clogn) < n'^^^, 
and taking the union bound, P{Nij < clogn for any i, j = 1, . . . ,5) < c^n^'^^'^, where C4 is 
a constant. Thus, with high probability, each partition of each interval contains at least clogn 
sensors for large enough c. 



lOrs(n) 




k,l 'k,3 ^k,4 ^k,5 



2r,(n) 
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Fig. 2. Worst case placement of transmitters for estimating tlieir locations. 

Consider any interval z^. If all the sensor readings in are zero, or if only the readings of 
the sensors lying in left half of i or right half of Pfc 5 are 1, then no transmitter lies in Zk. 
Otherwise, we know that there is a transmitter lying in z^, say Xj. Note that, it is hardest to 
detect the location of transmitter lying in Zk if there are transmitters in both intervals z^^i and 
Zk+i, and they lie closest to the boundary of Zk, as shown in Fig. [2} where black dots represent 
the transmitters. Let Xi G Pk.j. Then, an interval Wi of width ( ^'^'°^" ) around Xj contains at 
least ( '^^°^" ) sensors with high probability from the Chemoff bound, and all these sensors have 
reading 1. In addition, irrespective of the index j of partition P^j to which Xi belongs, there 
exists /m, G {1, • • • , 5} for which all sensors lying in the partition Pk,i,„ have a reading of 0, 
since all the sensors lying in Pk,l,r^ ^ distance greater than the radio range ( '^'°^" ) from 

the transmitter Xj in P^j. For example, in Fig. [2| all sensors lying in Pfc4 have their reading 
equal to 0. Hence, using the readings from sensors in Wi and Pfc,z„, we can identify the location 
of the transmitter located inside it, and the uncertainty about the z*'* transmitter location is no 
more than two times the width of any partition Pk,m- This is equal to C^"^'"^" ), and hence, 
\xi — Xi\ < ( ^^'°^" ). Since this is true for each transmitter i, M = M, the total localization 



Ema> 



max{Af,M} 



Xi Xi 



< Y^Li (^^) < M with high probability. This concludes 



the proof. 
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In this section, we considered the problem of estimating both the number of transmitters 
as well as their locations using n sensors making binary measurements. We showed that not 
knowing the number of transmitters does not significantly change the localization error as long 
as the minimum separation between any two transmitters is of order We first showed that if 
the minimum transmitter separation is less than order then the localization error probability 
cannot go to zero. Conversely, with the minimum transmitter separation of order using the 
Chemoff bound we showed that if the radio range is of order we can partition £ into small 
enough intervals so that with high probability, no interval contains more than one transmitter, 
while simultaneously ensuring that there are enough sensors in each interval for detection of 
the transmitter with high probability. The main message of this section is that if minimum 
separation between transmitters scales as then transmitter localization problem is invariant 
to the knowledge of the number of transmitters. For a practical scenario where transmitters are 
geographically separated, minimum separation requirement for our results is easily satisfied and 
hence the whitespace or received power profile can be detected efficiently. 

Remark 2: Another transmitter localization problem of interest is when the number of trans- 
mitters M scales with the number of sensors n. Theorem [v] suggests that if M{n) scales such 
that the minimum distance between any two transmitters scales no faster than order then 
a localization error of M(n)^^ can be guaranteed with high probability. So, clearly, for 
M{n) = O ^^1^^ , where the minimum distance between any two transmitters scales no 
faster than ^ [11], we have that the locaUzation error scales as ^/db^. i-^-, as - 



log n n 

Thus, our results also extend to the case where the number of transmitters scales with n, under 
certain conditions. 
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VI. Optimum Distribution of the Sensor Locations 

Thus far, we assumed that the transmitters are arbitrarily located on £, and obtained bounds 
on the minimum localization error and the optimal sensing radius in the worst case scenario. 
In some scenarios, it may be possible to obtain the spatial distribution of the transmitters on £, 
either as side information from the primary network or from long-term statistics collected by 
the sensors. In this section, we consider the optimization of the spatial distribution of the sensor 
locations. We assume that the transmitters are distributed over L = [0, 1) with pdf fx{x), and 
seek to find the optimum sensor distribution fx{x) over C that minimizes the probability of 
missing a transmitter. Mathematically, we wish to solve 

Pf= min [ {l-2rsfx{x)rfx{x)dx. (5) 

fxixy.J^ fx{x)dx=lJo 

In the above, given that the location of transmitter is x, 2rsfx{x) is the probability (for small r^) 
that there is a sensor in an area 2rs around it. Hence, (1 — 2rsfx{x))"' represents the probability 
that none of the sensors lie within the sensing range of the transmitter. By averaging over the 
distribution of x, Pf captures the probability that all the sensors have reading 0, and completely 
miss the transmitter at a random location in C. Using elementary results from variational calculus 



[15 1, the optimum fx{x) must satisfy 

- n (1 - 2rJx{x)r-' fx{x)2r, + /i = 0, (6) 
where /i is a Lagrange multiplier factor, and is chosen such that fx{x)dx = 1. This leads to 

U^)=\^-i^r-^XA^. (7) 



2nrsfx{,x)J J 2rs 

In the above, fxix) is taken to be for x such that fx{x) = 0, and when the right hand side is 
negative. In some cases, the above reduces to intuitively satisfying results. For example, when 
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= 1, < X < 1, the above implies that fx{x) = 1,0 < x < 1, i.e., the optimum density 
is also uniform. On the other hand, when n = 1, i.e., when only one sensor is deployed, fx{x) 
drops out of and hence, interestingly, any point density that is continuous and nonzero on 
[0, 1) performs equally well. 

The value of fi that ensures fx{x)dx = 1 has to be obtained using numerical techniques. 
This is not difficult, since the right hand side in Q is monotonically decreasing in /i, taking 
the value l/2rs > 1 when fi = 0, and taking the value as /i gets large. Thus, any simple 
numerical technique such as the bisection method can be used to find the value of /i. 

Now, substituting the optimum fx{x) into ^ and simplifying, we get 

Joifxix)) -idx 

We recognize the denominator as the ip norm of 1/ fxix), with p = l/{n—l) (which is in fact 
a quasi-norm). Note that, substituting the uniform point distribution for fxix) = 1,0 < x < 1 
in ([5]) results in pj""^-' = (1 — 2rs)". Thus, the performance improvement from the optimized 
point density depends on the magnitude of the denominator in ([8]). For example, consider the 
case were X has a triangular distribution: fx{x) = 4a; for < x < 1/2, and = 4(1 — x) for 
1/2 < X < 1. With some algebra, it can be shown that ([8]) reduces to 

,op„ ^ 2(1 - 2r.r ^ 2(1 - 2r.r 

Thus, the optimum point density does improve performance over the uniform point density, but 
both scale as (1 — 2rs)'^ with n. When = {\ogn)/n, for large n, (1 — 2rs)" ^ l/n"^, i.e., the 
probability of missing a transmitter is inversely proportional to n^. 
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VII. Simulation Results 

We now present Monte Carlo simulation results to illustrate the analytical results developed 
in this paper. We consider M transmitters and n sensors deployed uniformly at random locations 
over C = [0, 1). Sensors return a 1 if there is a transmitter within the sensing radius around 
them, and return otherwise. We identify the whitespace as the total area spanned by the 
2rs regions around sensors that return 0. To estimate the number of transmitters and their 
locations, we first identify the occupied space as the union of the 2rs regions around sensors 
that return 1. Then, for each contiguous occupied region of width smaller than 2rs, we identify 
one transmitter at the center of the region. In contiguous occupied regions of width greater 
than 2rs, we identify Lwidth/2rsJ transmitters, placed uniformly in the region. We compute the 
probability of the whitespace recovered exceeding 1 — t{n), i.e., the objective function in ([2]), 
with e(n) = {\ogn)/n, and the probability of the transmitter localization error as in (|4]), by 
averaging over 10, 000 instantiations of transmitter and sensor deployments. 

Figure [3] shows the probability of the whitespace recovered exceeding 1 — e{n), i.e., the 
objective function in (|2]), versus the number of sensors n, with M = 1 and 4 transmitters and 
e(n) = log(n)/n. We see that rs{n) = log(n)/n outperforms the other scaling factors, which is 
in line with the result in Theorem |3j In Fig. |4} we plot the probability that the sum absolute error 
in localizing the transmitters is < e{n), given by Q. We set e{n) = log(n)/n, and compare 
the performance of three different scalings for r^: \og{n)/n, (log(n)/n)^, and ^y\og{n)/n, for 
M = 1 and M = 4 transmitters. We see that \og{n)/n captures the optimal scaling of the radio 
range with n, and it significantly outperforms the other scalings considered. Moreover, even at 
moderate or low values of n, scaling rs{n) at a rate that is higher or lower than log (n)/n results 
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Fig. 3. Probability that the whitespace recovered is > 1 — ^{n), with e(n) = log(n)/n. 
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Fig. 4. Probability that the sum absolute error in localizing the transmitters is < e(n), with e(n) = log(n)/n. 

in a significant degradation in the performance. 

Finally, Fig. |5] shows the probability of missing a transmitter uniformly distributed on [0, 1), 
and n sensors with sensing radius rs{n) = \og{n)/n are deployed according to the triangu- 
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Fig. 5. Probability of missing a transmitter, wlien n sensors with sensing radius ra(n) = log(n)/n are deployed according 
to the triangular, truncated Gaussian and uniform distributions. 

lar, truncated Gaussian and uniform distributions. For the triangular distribution, we consider 
f\{x) = Ax for < a; < 1/2, and = 4(1 — x) for 1/2 < x < 1. For the truncated Gaussian 
distribution, we consider the Gaussian distribution with mean 0.5 and standard deviation 0.25, 
truncated to [0,1]. We see that, as expected, the uniform distribution outperforms the other 



distributions, and its performance matches with the P/ ^ Xjr? result derived in Sec. VI 



VIII. Conclusions 

In this paper, we studied the recovery of whitespace using n sensors that are deployed at 
random locations within a given geographical area. We derived the limiting behavior of the 
recovered whitespace as a function of n and and the sensing radius r^, and showed that both 
the whitespace recovery error (loss) and the radio range optimally scale as log(n)/n as n gets 
large. We also showed that, surprisingly, the radio range scaling of log(n)/n is optimal even 
with unreliable sensors. Using the sum absolute error in transmitter localization as the metric, 
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we also analyzed the optimal scaling of the radio range that minimizes the localization error 
with high probability, as n gets large. We also derived the corresponding optimal localization 
error, and showed that it scales as log(n)/n as well. Finally, we derived the optimal distribution 
of sensors that minimizes the probability of missing a transmitter, for a given distribution of the 
transmitters, and analyzed the behavior of the miss detection probability as n is increased. Our 
results yielded useful insights into the number of sensors to be deployed and the radio range 
for detecting transmitters that maximizes the recovered whitespace and accurately localizes 
the transmitters within the given geographical area. Future work could involve extending the 
sensor detection model to account for temporal and spatial variations in the signal power due 
to shadowing, multipath fading, transmitter movement, etc. 
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